Electronic device and control method thereof

ABSTRACT

An electronic device is disclosed. The electronic device comprises: a communicator for communicating with a server storing information on a plurality of short clips and storing keywords by the plurality of short clips; an outputter; an inputter; and a processor which, when a voice uttered by a user is received via the inputter, transmits a short clip request signal to the server, on the basis of a keyword included in the received uttered voice and information on content outputted from the outputter, and outputs a short clip via the outputter, on the basis of information on the short clip received from the server in response to the request signal.

TECHNICAL FIELD

The present disclosure pertains to an electronic device and acontrolling method thereof, and more particularly, to an electronicdevice which provides a short clip and a controlling method thereof.

BACKGROUND ART

Recently, various types of multimedia devices have been developed due tothe development of electronic technology. In particular, multimediadevices such as TVs, PCs, laptop computers, tablet PCs, smart phones andthe like are widely used in most households.

In addition, in order to meet the needs of users who desire variousfunctions, an effort is being made to develop a new personal assistantservice (Smart Assistant) that combines voice recognition with amultimedia device.

However, according to the related art, a text-based search result for auser's query is only provided as an unnatural voice using a TTS.

Further, when the video or audio content is included in the searchresult, the content is provided as an original. In this case, since theoriginal content includes a plurality of parts irrelevant to the user'squestion, there is a problem that the search result that is meaninglessto a user is accepted.

Therefore, there is a necessity that only parts related to a user'squery from the original content are provided as a search result.

DETAILED DESCRIPTION Tasks to be Solved

The object of the present disclosure is to provide an electronic devicewhich provides a short clip regarding an original content based on akeyword and a controlling method thereof.

Means for Solving Problems

The electronic device according to an exemplary embodiment includes acommunicator configured to communicate with a server for storinginformation on a plurality of short clips and keywords by the pluralityof short clips; an outputter; an inputter; and a processor configuredto: in response to a user's uttered voice being received through theinputter, transmit a short clip request signal to the server based on akeyword included in the received uttered voice and information on acontent output from the outputter, and output a short clip through theoutputter based on information on a short clip received from the serveraccording to the request signal.

The information on the plurality of short clips may include at least oneof information on a position in which the plurality of short clips arestored and time section including the keyword, and the processor, inresponse to information regarding a short clip being received from theserver according to the request signal, outputs the short clip based onthe received information.

Each of the plurality of short clips may be a video content or an audiocontent which is generated by editing a portion including a specifickeyword from a specific content.

The processor, in response to additional information regarding the shortclip being received, may provide additional information regarding theshort clip, and the additional information regarding the short clip mayinclude at least one of a title and a genre of an original content,broadcast time of the original content, generation time of the shortclip, broadcaster information of the original content, and the keyword.

The outputter may include at least one of a display and a speaker.

According to another exemplary embodiment, the outputter may includeonly a speaker, and the processor provides additional information on theshort clip as an audio through the speaker.

The outputter may include at least one of a display and a speaker, andthe processor additionally may transmit a short clip request signalrelated to the keyword to the server based on a keyword which isrepeated for a predetermined times or more during a predetermined timefrom audio that is output through the speaker.

The processor may provide additional response information on the utteredvoice along with the short clip based on a keyword included in thereceived uttered voice.

The processor may transmit the request signal which includes the keywordand user information to the server and receive a short clip relating tothe keyword and the user information from the server.

The processor, in response to the uttered voice being received, maytransmit the received uttered voice to the voice recognition server orthe server, and transmit a short clip request signal to the server basedon the keyword and information on the content received from the voicerecognition server or the server.

According to an exemplary embodiment, a controlling method of anelectronic device which communicates with a server for storinginformation on a plurality of short clips and keywords by the pluralityof short clips includes outputting a content; receiving a user's utteredvoice; in response to the uttered voice being received, transmitting ashort clip request signal to the server based on a keyword included inthe received uttered voice and information on a content output from theoutputter; and outputting a short clip through the outputter based oninformation on a short clip received from the server according to therequest signal.

The information on the plurality of short clips may include at least oneof information on a position in which the plurality of short clips arestored and time section including the keyword, and wherein theoutputting comprises, in response to information regarding a short clipbeing received from the server according to the request signal,outputting the short clip based on the received information.

Each of the plurality of short clips may be a video content or an audiocontent which is generated by editing a portion including a specifickeyword from a specific content.

The outputting the short clip may include, in response to additionalinformation regarding the short clip being received, providingadditional information regarding the short clip, and the additionalinformation regarding the short clip may include at least one of a titleand a genre of an original content, broadcast time of the originalcontent, generation time of the short clip, broadcaster information ofthe original content, and the keyword.

The outputting the short clip may include providing additionalinformation on the short clip as an audio through the speaker.

The electronic device includes at least one of a display and a speaker,and the transmitting may include additionally transmitting a short cliprequest signal related to the keyword to the server based on a keywordwhich is repeated for a predetermined times or more during apredetermined time from audio that is output through the speaker.

The outputting the short clip may include providing additional responseinformation on the uttered voice along with the short clip based on akeyword included in the received uttered voice.

The transmitting may include transmitting the request signal whichincludes the keyword and user information to the server and receiving ashort clip relating to the keyword and the user information from theserver and outputting the same.

The transmitting may include transmitting the received uttered voice tothe voice recognition server or the server, and transmitting a shortclip request signal to the server based on the keyword and informationon the content received from the voice recognition server or the server.

A system including the electronic device and a server according to anexemplary embodiment includes the electronic device which includes aserver that generates information on a plurality of short clips based oneach keyword of a plurality of original contents and stores informationon the generated plurality of short clips and keywords by the pluralityof short clips, and when a user's uttered voice is received, based onthe keywords included in the received uttered voice and information onthe content which the electronic device outputs, transmits a short cliprequest signal to the server, and outputs a short clip based oninformation on the short clip received form the server according to therequest signal.

Effect of Invention

According to various embodiments of the present disclosure as describedabove, a short clip for the original content is provided based on thekeyword included in the uttered voice of the user, so that the user'sconvenience can be increased.

DETAILED DESCRIPTION OF DRAWINGS

FIG. 1 is a view to describe a system to provide a short clip accordingto an exemplary embodiment.

FIGS. 2A and 2B are block diagrams illustrating a configuration of anelectronic device according to an exemplary embodiment.

FIG. 3 is a block diagram illustrating a configuration of a serveraccording to an exemplary embodiment.

FIG. 4 is a view to describe a method for outputting a short cliprelating to a keyword according to an exemplary embodiment.

FIG. 5 is a view to describe a method for outputting a short cliprelating to an output content according to an exemplary embodiment.

FIG. 6 is a view to describe a method for acquiring a keyword byanalyzing an audio signal according to an exemplary embodiment.

FIG. 7 is a view to describe additional information regarding a shortclip according to an exemplary embodiment.

FIG. 8 is a view to describe additional response information which isprovided along with a short clip according to an exemplary embodiment.

FIG. 9 is a flowchart to describe a method for providing a short clipaccording to an exemplary embodiment.

FIG. 10 is a flowchart to describe a system for providing a short clipaccording to an exemplary embodiment.

FIG. 11 is a view to describe a method that an electronic deviceprovides a short clip through a speaker according to another exemplaryembodiment.

BEST MODE OF THE INVENTION Detailed Description

Hereinafter, the present disclosure will be described in detail withreference to the drawings. In the following description of the presentdisclosure, detailed description of known functions and configurationsincorporated herein will be omitted when it may make the subject matterof the present invention rather unclear. In addition, the followingembodiments can be modified into various other forms, and the scope ofthe technical idea of the present disclosure is not limited to thefollowing examples. Rather, these embodiments are provided so that thisdisclosure will be more thorough and complete, and will fully convey thescope of the present disclosure to those skilled in the art.

Also, “comprising” means that other components may be included, ratherthan excluding other components, unless specifically stated otherwise.Further, various elements and regions in the drawings are schematicallydrawn. Accordingly, the spirit of the present disclosure is not limitedby the relative size or spacing depicted in the accompanying drawings.

FIG. 1 is a view to describe a system to provide a short clip accordingto an exemplary embodiment.

The electronic device 100 may be implemented as various types of devicesthat output content using at least one of a display and a speaker.Accordingly, the electronic device 100 may be implemented as a digitalTV, but the present disclosure is not limited thereto and may beimplemented as various types of devices having a display function suchas a PC, a mobile phone, a tablet PC, a PMP, a PDA, and a navigator. Inaddition, it is needless to say that the electronic device 100 may beimplemented as an audio output device without a display function. Inthis case, the content may be outputted as an audio signal through thespeaker. Hereinafter, for convenience of description, it is assumed thatthe electronic device 100 is implemented as a digital TV. An exemplaryembodiment in which the electronic device 100 includes only a speakerwithout a display function will be described in detail with reference toFIG. 10.

The electronic device 100 according to an embodiment of the presentdisclosure can receive the uttered voice of the user and obtain thekeyword included in the received uttered voice. Specifically, theelectronic device 100 can transmit the received uttered voice to a voicerecognition server (not shown), and receive the key word included in theuttered voice from the voice recognition server. However, the presentdisclosure is not limited thereto, and the electronic device 100 mayanalyze the user's uttered voice to obtain keywords.

The server 200 according to an exemplary embodiment may provide a shortclip and analyze the uttered voice and can be used as a voicerecognition server which transmits the keyword included in the utteredvoice to the electronic device 100.

The electronic device 100 may transmit a short clip request signal tothe server 200 based on the keyword included in the received utteredvoice and information on the content output by the electronic device100. In this case, the electronic device 100 can receive the informationabout the short clip corresponding to the request signal from the server200, and output the short clip based on the received information. Here,the information on the short clip may be at least one of information ona short clip, a position where the short clip is stored, and informationon a time period including the keyword. For example, when a timeinterval including a keyword is received, the electronic device 100 mayreproduce and output only a time interval including a specific keywordin the content based on the time interval.

The server 200 can store information on a plurality of short clips andkeywords for a plurality of short clips. Specifically, the server 200can receive the content from a content provider 300 and generate a shortclip from the received content. For example, the server 200 may receivebroadcast content from a broadcaster and may generate a plurality ofshort clips from the received broadcast content. Hereinafter, forconvenience of description, the content received from the contentprovider 300 is referred to as an original content.

A short clip refers to an image obtained by editing a specific part or apart of a received original content, and in some cases, a plurality ofcontents may be combined. For example, a specific portion or some ofeach of the plurality of contents may be obtained, and the obtainedportions may be combined to generate a short clip.

According to an exemplary embodiment of the present disclosure, theserver 200 may analyze the audio signal of the original content and editthe original content in units of Endpoint Detection (EPD). Here, the EPDmeans an algorithm for analyzing the audio signal of the originalcontent and detecting the start and end points of the audio in realtime.

In addition, the server 200 can acquire keywords by analyzing the voiceincluded in each of the edited images in units of EPD. Accordingly, theserver 200 can acquire and store keywords matched to each of a pluralityof edited videos and a plurality of videos edited in units of EPD fromone original content. Here, the keyword matched to the edited video maybe at least one.

According to an embodiment of the present disclosure, when the server200 analyzes the audio signal included in the edited video to acquire aplurality of keywords, a plurality of keywords may be matched to oneedited video and stored in the server. In the meantime, the originalcontent is not necessarily edited in EPD units, and the server 200 mayedit the original content based on various voice detection algorithms togenerate a plurality of short clips. The method of generating keywordsfor the short clip and the short clip of the server 200 will bedescribed in detail with reference to FIG. 3. Hereinafter, forconvenience of description, an edited video obtained from the originalcontent is named as a short clip.

The short clip may be a video in which a specific part of the originalcontent, for example, a part including a specific keyword is editedwithin a predetermined time (for example, within three minutes).However, it is needless to say that the short clip is not limited to thevideo content but can be generated by editing the sound content.Furthermore, since the playback time of the short clip can be changedaccording to the setting and voice detection algorithm, it is notlimited within three minutes.

In the meantime, the server 200 may generate and store information abouta short clip at the time of generating a short clip. Here, theinformation on the short clip may include at least one of a positionwhere the short clip is stored and a time interval including thespecific keyword. Specifically, the server 200 may acquire a keyword byanalyzing an audio signal included in the short clip, and may store akeyword matching the short clip and the short clip. Accordingly, theserver 200 can store a plurality of short clips and a plurality of shortclip-specific keywords. In addition, the server 200 according to anembodiment of the present disclosure may store the title, the genre, thebroadcast time of the original content, the generation time of the shortclip, and the broadcast station information of the original content onthe basis of the metadata of the original content along with the shortclip.

The electronic device 100 according to an exemplary embodiment of thepresent disclosure analyzes a uttered voice of a user and transmits ashort clip request signal related to a keyword included in a utteredvoice to a server, and the server 200 may transmit a short clipregarding the keyword included in the received request signal to theelectronic device 100. In addition, the electronic device 100 maydisplay the received short clip and provide it to the user.

Meanwhile, as described above, the electronic device 100 according to anembodiment of the present disclosure may transmit a user's uttered voiceto a voice recognition server and receive a keyword included in theuttered voice from the voice recognition server. It is also possiblethat the server 200 providing the short clip receives the voice utteredby the user and transmits the keyword included in the uttered voice tothe electronic device 100. That is, the voice recognition server orserver 200 may be implemented to perform speech recognition in theprocess of converting a received voice into text and acquiring a keywordin the converted text, when an uttered voice of the user is received.

Hereinbelow, referring to the block diagram illustrating the specificconfiguration of the electronic device 100 will be described regardingthe exemplary embodiment of the present disclosure.

FIGS. 2A and 2B are block diagrams illustrating a configuration of anelectronic device according to an exemplary embodiment.

Referring to FIG. 2A, the electronic device 100 includes thecommunicator 110, the inputter 120, the outputter 130, and the processor140.

The communicator 110 performs communication with an external deviceaccording to various types of communication methods.

In particular, the communicator 110 can communicate with the server 200that stores keywords for a plurality of short clips and a plurality ofshort clips using at least one of a wired/wireless scheme. Also, thecommunicator 110 can perform communication with the voice recognitionserver. Here, the communicator 110 may include various communicationchips such as a chip, a Bluetooth chip, a wireless communication chip,and an NFC chip.

As will be described later, the communicator 110 can transmit an utteredvoice received by the voice recognition server and receive a keywordincluded in the uttered voice when the user's uttered voice is receivedthrough the inputter 120. Meanwhile, when the server 200 is used as avoice recognition server, the communicator 110 may transmit the receiveduttered voice to the server 200 and receive the keyword from the server200. However, the present disclosure is not limited thereto, and it isneedless to say that the electronic device 100 may acquire a keyword byperforming voice recognition of a user's uttered voice, withoutperforming communication with the voice recognition server or the server200.

Meanwhile, the communicator 110 according to an exemplary embodiment ofthe present disclosure may transmit a signal requesting a short clip tothe server 200, and may receive a short clip according to a requestsignal from the server 200. Here, the request signal is a signal basedon information on keywords and contents included in the uttered voice ofthe user. In one example, the request signal may be a signal including akeyword and information about the content that the electronic device 100is outputting. As another example, the request signal may be sent to theserver 200 either in succession or concurrently with the keyword and aseparate signal comprising information about the content being output bythe electronic device 100.

The request signal according to an exemplary embodiment of the presentdisclosure may be a signal including information on contents displayedon the electronic device 100, keywords repeatedly output from thecontents, information on the user of the electronic device 100. Here,the keyword repeatedly output from the content may be a keyword repeatedfor a predetermined number of times or more within a predeterminedperiod of time in the content output by the electronic device 100.Hereinafter, for convenience of description, the content displayed onthe electronic device 100 or the content being output is referred to asan output content.

In addition, the communicator 110 may receive a short clip from theserver 200 as a response to the above-described request signal.

Specifically, when receiving a short clip request signal from theelectronic device 100, the server 200 may transmit a short clipcorresponding to the request signal to the electronic device 100.However, the present disclosure is not limited thereto, and the server200 may store information on a location where the original contentcorresponding to the request signal is stored and time informationcorresponding to a short clip among the original contents. For example,the server 200 may transmit the web address for reproducing the originalcontent and the time information corresponding to the short clip amongthe original contents to the electronic device 100. In this case, theelectronic device 100 can access the server where the original contentis stored based on the received web address, and can reproduce thesection corresponding to the time information.

For example, the electronic device 100 may receive the web address fromwhich the specific content is received from the server 200 and the timeinformation on the section including the keyword in the specificcontent. In this case, the electronic device 100 accesses the receivedweb address to receive a specific content, and can reproduce and outputonly a specific section of the specific content based on the timeinformation.

The inputter 120 is configured to receive a user's uttered voice andconvert it into audio data. Specifically, the inputter 120 may beimplemented as a microphone and receive a user's uttered voice. However,the present disclosure is not limited thereto, and the inputter 120 maybe provided in a remote control device (not shown) for controlling theelectronic device 100, rather than the electronic device 100, to receivethe user's uttered voice.

If the electronic device 100 is implemented as a touch-based electronicdevice, the inputter 120 may be implemented as a touch screen having amutual layer structure with the touch pad. In this case, the inputter120 may receive the keyword input through the touch screen in additionto the uttered voice.

The outputter 130 may output at least one of various contents and ashort clip. Specifically, the outputter 130 may include at least one ofa display and a speaker. When the outputter 130 includes a display, theoutputter 130 may display various content playback screens such asimages, moving pictures, texts and music, application execution screensincluding various contents, a web browser screen, a graphic userinterface (GUI) Screen and the like can be displayed.

In this case, the display may be implemented as liquid crystal displaypanel (LCD) and organic light emitting diode (OLED) and so on, but isnot limited thereto. In addition, the display may be implemented as aflexible display and a transparent display according to cases.

In particular, the display may display a short clip received from theserver 200.

Meanwhile, when the outputter 130 according to another embodiment of thepresent disclosure includes only a speaker, the outputter 130 mayprovide the received short clip as audio through a speaker. For example,when the electronic device 100 is implemented as an audio output devicethat does not have a display function, the outputter 130 may provideadditional information about the received short clip as audio, andprovide only an audio signal of the short clip.

The processor 140 controls overall operations of the electronic device100.

In particular, when the uttered voice of the user is received throughthe inputter 120, the processor 140 may transmit a signal requesting ashort clip based on the keyword and contents information included in thereceived uttered voice to the server 200 through the communicator 110.In addition, the short clip received from the server 200 according tothe request signal can be output through the outputter 130.

To be specific, the processor 140 may send information about the outputcontent to the server 200. Here, the information on the output contentinclude the title, genre, broadcast time, broadcasting stationinformation, and the like of the output content. Thus, if the processor140 sends a short clip request signal to the server 200 based on atleast one of the keyword and the information about the output content,the processor 140 may receive and provide a short clip associated withthe keyword and output content.

In this case, at the time when the processor 140 transmits the shortclip request signal to the server 200, the short clip generated by theserver 200 can be received. Here, the generated short clip may be ashort clip generated from a content different from the output content.For example, it may be broadcast content that was broadcast before theoutput content broadcast time.

However, the present disclosure is not limited to this, and a short clipgenerated from the output content can also be received. According to anexemplary embodiment, when the output content is broadcast content thatis received in real time, the server 200 may also receive the broadcastcontent. When the processor 140 transmits a request signal, if a shortclip regarding the output content is created, the corresponding shortclip can also be targeted. For example, if broadcast start timing isearlier than predetermined time than the timing of requesting a shortclip, the short clip regarding the output content can be pre-generated.

In the meantime, the processor 140 may receive additional informationabout the short clip. Specifically, the processor 140 may receive andprovide additional information about the short clip and the short clipfrom the server 200. Here, the additional information on the short clipmay be information including at least one of a title, a genre of theoriginal content of the short clip, a broadcast time of the originalcontent, a generation time of the short clip, a broadcast station of theoriginal content.

In addition, the processor 140 may analyze the audio signal of theoutput content and transmit a signal requesting a short clip related tothe keyword to the server 200 based on a keyword repeated apredetermined number of times or more for a predetermined time.Accordingly, the processor 140 can acquire a repeated word in the outputcontent as a keyword, and transmit the keyword to the server 200 toreceive the associated short clip.

Also, the electronic device 100 may include a storage (not shown) forstoring user information, and the processor 140 may transmit a requestsignal including the user information stored in the storage to theserver 200. In this case, the processor 140 may receive and display ashort clip associated with the user information. Here, the userinformation may be information on the user of the electronic device 100,including information such as age range, preferred genre, favoritecontent, favorite broadcasting station, and the like. Therefore, whenthe electronic device 100 receives a plurality of short clips from theserver 200, it can receive and display a more suitable short clip to theuser based on the keyword and user information.

FIG. 2B is a block diagram illustrating a detailed configuration of theelectronic device 100 according to another embodiment of the presentdisclosure. According to FIG. 2B, the electronic device 100 includes acommunicator 110, an inputter 120, an outputter 130, a processor 140, astorage 150, an audio processor 160, and a video processor 170. Thedetailed description of the components shown in FIG. 2B that are thesame as those shown in FIG. 2A will be omitted.

The processor 140 controls the overall operation of the electronicdevice 100 using various programs stored in the storage 150. Theprocessor 140 may include one or more of a central processor (CPU), acontroller, an application processor (AP), a communication processor(CP), an ARM processor, or may be defined by the term. In addition, theprocessor 140 may be implemented as a digital signal processor (DSP), aSoC with a content processing algorithm embedded therein, or a FieldProgrammable Gate Array (FPGA).

To be specific, the processor 140 includes RAM 141, ROM 142, main CPU143, graphic processor 144, the first to nth interfaces 145-1˜145-n, andbus 146.

The RAM 141, the ROM 142, the main CPU 143, the graphic processor 144,and the first to nth interface 145-1˜145-n can be interconnected throughthe bus 136, etc.

The first to nth interfaces 145-1 to 145-n are connected to theaforementioned various elements. One of the interfaces can be networkinterface that is connected to an external device through network.

The main CPU 143 access the storage 150 and performs booting using theO/S stored in the storage 150. By using various programs, contents, anddata stored in the storage 150, various operations are performed.

The ROM 142 stores a command set for system booting. If power issupplied as the turn-on command is input, the main CPU 143 copies theO/S stored in the storage 150 to the RAM 141 according to the commandstored in the ROM 142, executes the O/S to boot the system. If thebooting is completed, the main CPU 143 copies various applicationprograms stored in the storage 150 to the RAM 141 and executes theapplication program copied to the RAM 141 to perform an operation.

The graphic processor 144 generates a screen including various objectssuch as an icon, an image, and a text using an operation unit (notshown) and a rendering unit (not shown). An operation unit (not shown)calculates an attribute value such as a coordinate value, a shape, asize, and a color to be displayed by each object according to the layoutof the screen based on the received control command. The rendering unit(not shown) generates screens of various layouts including the objectsbased on the attribute values calculated by the operation unit (notshown). The screen generated in the rendering unit (not shown) isdisplayed in the display area of the outputter 130.

The storage 150 stores various data such as the O/S software module todrive the electronic device 100, various multimedia contents,applications, and various contents which are input or set duringexecution of applications. In particular, the storage 150 may store, forexample, user preference information, age group, user profileinformation, and so on.

The audio processor 160 is an element to perform processing of audiodata. The audio processor 160 may perform decoding, amplification, andnoise filtering with respect to audio data. For example, the audioprocessor 160 may generate and provide the feedback sound whichcorresponds to a case in which user preference information displayedduring channel zapping satisfies a preset criteria, and so on.

The video processor 170 is an element to perform processing of videodata. The video processor 170 may perform various image processing suchas decoding, scaling, noise filtering, frame rate conversion, andresolution conversion with respect to video data.

FIG. 3 is a block diagram illustrating a configuration of a server 200according to an exemplary embodiment.

Referring to FIG. 3, the server 200 includes a communicator 210, astorage 220, and a processor 230.

The communicator 210 performs communication with an external device inaccordance with various types of communication methods.

In particular, the communicator 210 may perform communication with thecontent provider 300 using at least one of a wired/wireless method.Specifically, the communicator 210 can receive the content from thecontent provider 300. Here, the communicator 210 may include variouscommunication chips such as a Wi-Fi chip, a Bluetooth chip, a wirelesscommunication chip, an NFC chip, and a tuner.

The communicator 210 according to an exemplary embodiment may performcommunication with the electronic device 100. To be specific, thecommunicator 210 may receive a short clip request signal transmittedfrom the electronic device 100 and transmit the short clip to theelectronic device 100.

The storage 220 stores the various data such as the O/S software moduleto drive the server 200, various multimedia contents, applications, andvarious contents which are input or set during execution ofapplications.

In particular, the storage 200 may store an original content, aplurality of short clips generated from the original content andkeywords by a plurality of short clips.

According to an embodiment of the present disclosure, when the server200 generates a plurality of short clips by editing the originalcontent, at least one keyword may be obtained according to the audiosignals included in the plurality of short clips. In this case, theserver 200 may store the short clips and the keywords obtained from theshort clips in the storage 220. For example, when the audio signalincluded in the first short clip is analyzed to obtain the first andsecond keywords, the server 200 may store the first and second keywordstogether with the first short clip.

According to an embodiment of the present disclosure, the server 200 maygroup and store short clips for each keyword. In this case, short clipsincluding the audio signal corresponding to the first keyword may begrouped and stored in the storage 220. Accordingly, if the first keywordis included in the short clip request signal received from theelectronic device 100, the server 200 can transmit a plurality of shortclips grouped corresponding to the first keyword to the electronicdevice 100.

The processor 230 controls overall operations of the server 200.

When the server 200 performs a voice recognition function according toan exemplary embodiment, the processor 230 may analyze the uttered voicereceived from the electronic device 100 and obtain a keyword included inthe uttered voice. The server 200 may transmit a keyword to theelectronic device 100.

In addition, when the original content is received through thecommunicator 210, the processor 230 may edit the received originalcontent to generate a plurality of short clips. Specifically, theprocessor 230 may edit only a specific section of the original contentbased on the speech detection algorithm. Here, the voice detectionalgorithm means an algorithm for detecting an audio signal including atleast one keyword.

For example, the processor 230 may analyze an audio signal of theoriginal content to detect a starting point and an ending point of voiceand edit a section between the start point and end point (EPD unit) togenerate a short clip.

However, the present disclosure is not limited thereto, and the server200 may be configured to transmit the short clip request signal based ona preset time interval, a specific interval set by the content provider,a time interval set by the administrator of the server 200, and edit theoriginal content to create a short clip.

According to an exemplary embodiment, when the processor 230 determinesthat the voice is terminated after the first detection of a voice in thebroadcast content received in real time, the processor 230 may edit thecorresponding section in real time to generate a short clip. In thiscase, the processor 230 may determine that the voice has been terminatedif no voice is detected for a predetermined time or if a mechanicalnoise, noise, or the like is detected for a predetermined time or more.The processor 230 may then store the generated short clip and theobtained keyword in the storage 220 together. Thus, the processor 230may send a short clip to the electronic device 100 in response to ashort clip request signal received from the electronic device 100.

Meanwhile, the server 200 according to an embodiment of the presentdisclosure may store the web address that can receive the originalcontent and the time information about the section including thespecific keyword in the database, instead of generating the short clipfrom the original content. In this case, when a short clip requestsignal is received from the electronic device 100, the server 200 maytransmit web address for receiving an original content corresponding tothe short clip request signal and section information including aspecific keyword from the original content to the electronic device 100.Accordingly, the electronic device 100 may, instead of receiving a shortclip from the server 200, provide a short clip through a method ofoutputting a section which includes a specific keyword from an originalcontent based on web address and time information.

Hereinbelow, a short clip provision according to various exemplaryembodiments of the present invention will be described. In the meantime,for convenience of description, FIGS. 4-8 will describe an exemplaryembodiment in which the electronic device 100 is implemented to includea display, and an output content and a short clip are output through adisplay.

FIG. 4 is a view to describe a method for outputting a short cliprelating to a keyword according to an exemplary embodiment.

According to FIG. 4, the electronic device 100 may receive the utteredvoice of the user. In this case, the electronic device 100 can analyzethe voice uttered by the user and acquire the keyword included in theuttered voice. For example, if the received uttered voice of the user is“tell me the current traffic information”, the electronic device 100 canacquire “traffic information” as a keyword. Meanwhile, it is needless tosay that the electronic device 100 according to another embodiment ofthe present disclosure may acquire the keyword included in the utteredvoice by performing communication with the voice recognition server orthe server 200.

The electronic device 100 may also send a signal to the server 200requesting a short clip for the acquired keyword. In this case, theserver 200 may send a short clip for the keyword to the electronicdevice 100. Specifically, the server 200 may transmit a specific shortclip to the electronic device 100 based on short clip and short clipspecific keywords generated from the original content until receivingthe request signal from the electronic device 100. For example, if thekeyword included in the short clip request signal is “trafficinformation”, the server 200 transmits only a short clip having thekeyword “traffic information” to the electronic device 100. In thiscase, the electronic device 100 can receive a short clip having thekeyword “traffic information” generated by editing a specific section ofa news program transmitted from a content provider, that is, abroadcaster. Accordingly, the received short clip may be video contentincluding an audio signal corresponding to “traffic information”.

Meanwhile, according to an exemplary embodiment of the presentdisclosure, the electronic device 100 may transmit a short clip requestsignal including user information to the server 200. In this case, theserver 200 may transmit a short clip related to the keyword and the userinformation to the electronic device 100. For example, when the locationof the electronic device 100 corresponds to “Seoul” according to theuser information, the server 200 selects “traffic information” and“Seoul” from among a plurality of short clips having “trafficinformation” as a keyword can transmit a short clip satisfying both“traffic information” and “Seoul” to the electronic device 100. Thus,the electronic device 100 can display a short clip optimized for theuser among the short clips generated in real time.

Meanwhile, according to an embodiment of the present disclosure, theelectronic device 100 may provide an output mode and a short clip mode.The output mode may be a mode for continuously outputting only theoutput content regardless of whether or not short clip reception isperformed from the server 200. In addition, the short clip mode may be amode for displaying a short clip received from the server 200. Theelectronic device 100 can display the short clip by switching from theoutput mode to the short clip mode at the end of the output content (forexample, during CF broadcast). In the meantime, the present disclosureis not limited thereto, and switching between the output mode and theshort clip mode may be performed in response to user input. For example,when a user's uttered voice is received in the output mode, the user canautomatically switch to the short clip mode and display the short clipreceived from the server 200. Further, the output mode and the shortclip mode may be simultaneously executed. For example, when a short clipis received from the server 200, the received short clip may overlapwith the output content and be displayed in a partial area of theoutputter 130.

Hereinbelow, a method for receiving a short clip based on an outputcontent will be described.

FIG. 5 is a view to describe a method for outputting a short cliprelating to an output content according to an exemplary embodiment.

According to FIG. 5, the electronic device 100 may transmit informationabout the output content to the server 200 in addition to the keywordobtained from the user's uttered voice, in the short clip requestsignal. In this case, the server 200 may transmit a specific short clipto the electronic device 100 based on the keyword and the short cliprequest signal.

More specifically, the information on the output content meansinformation on the content being output to the electronic device 100,and can he obtained from the metadata on the output content. Forexample, the information on the output content may include the title,genre, broadcast time, broadcast station information, and the like ofthe output content. However, the present disclosure is not limitedthereto, and the information about the content can be acquired throughvarious methods. For example, it is possible to receive informationabout a content from an external server, perform OCR on the screen, andobtain additional information.

As shown in FIG. 5, if the user's uttered voice is “Please inform me ofa batter of Team A”, the electronic device 100 can acquire at least oneof “Team A” and “batter” as keywords. In addition, if the output contentis a baseball game, the electronic device 100 may include information(e.g., “sports”, “baseball”) and keywords (e.g, “Team A” and ‘batter”)to the server 200 as a short clip request signal. In this case, theserver 200 can transmit, to the electronic device 100, a short clipkeyword of “sports”, “baseball”, “Team A” and “batter” among a pluralityof short clips. Accordingly, the electronic device 100 can receive anddisplay an interview image of the batter of Team A, sports news of TeamA, and the like from the server 200. Meanwhile, as described above, theplurality of short clips received by the electronic device 100 may bevideo content generated by editing the specific section of the originalcontent transmitted from the broadcasting company and received by theserver 200.

Hereinbelow, a method of acquiring a keyword from an audio signal whichis output from the electronic device 100 and receiving a short clip withrespect to the acquired keyword will be described.

FIG. 6 is a view to describe a method for acquiring a keyword byanalyzing an audio signal according to an exemplary embodiment.

According to FIG. 6, a case in which the content which is output fromthe electronic device 100 is repeatedly output can be assumed. In thiscase, the electronic device 100 may, in addition to the keyword acquiredform the uttered voice of the user, include the words repeatedly outputfrom the output content to the short clip request signal and transmitthe same to the server 200.

To be specific, the electronic device 100 may transmit a keyword whichis repeated for more than a predetermined time for predetermined timefrom audio output through the speaker provided in the electronic device100 to the server 200.

For example, if the output content is a travel information program for“Spain”, the electronic device 100 can analyze the audio signal of theoutput content and obtain “Spain”, “Barcelona”. In this case, the server200 can transmit short clips matching “Spain” and “Barcelona” among theplurality of short clips to the electronic device 100. Accordingly, theelectronic device 100 can receive a short clip for “Spain” and“Barcelona” from the server 200 and display the same. Meanwhile, asdescribed above, the electronic device 100 may transmit the short cliprequest information to the server 200 including the information aboutthe output content. In this case, the electronic device 100 may receivea short clip generated by editing a specific section of the travelinformation program for “Spain” and “Barcelona”.

In the meantime, the electronic device 100 according to an exemplaryembodiment may display the short clip received from the server 200 as athumbnail image. In this case, a short clip which corresponds to athumbnail image according to a user's input can be reproduced.

Hereinbelow, a specific method of displaying a short slip on theelectronic device 100 will be described.

FIG. 7 is a view to describe additional information regarding a shortclip according to an exemplary embodiment.

According to FIG. 7, the electronic device 100 may additionally receiveinformation regarding a short clip from the server 200 and provide thereceived information along with the short clip.

More specifically, the additional information about the short clipincludes at least one of the title 710 of the original content, thegenre, the broadcast time 720 of the original content, the broadcaststation information 730 of the original content, the generation time ofthe short clip. Here, the broadcast time of the original content maymean a time when the server 200 received the content from the contentprovider 300, a generation time of the original content, a time when theoriginal content was transmitted from the broadcasting station, and thelike. In addition, the keyword of the short clip may mean a keyword thatmatches the keyword included in the short clip request signal among atleast one keyword matched to the short clip.

As shown in FIG. 7, the additional information on the short clip can bedisplayed when the short clip selected in accordance with the user'sinput is reproduced. However, the present disclosure is not limitedthereto, and it is needless to say that the electronic device 100 maydisplay a plurality of short clips received from the server 200 as athumbnail image and simultaneously display additional information aboutthe short clip.

Hereinbelow, a method for displaying additional response informationwith respect to a keyword included in the uttered voice of a user willbe described.

FIG. 8 is a view to describe additional response information which isprovided along with a short clip according to an exemplary embodiment.

Referring to FIG. 8, the electronic device 100 can receive, from anexternal server, additional response information for a keyword obtainedfrom a user's uttered voice and display it together with a short clip.Here, the additional response information may include a search result810 for the keyword, information about the keyword, and the like.However, the present disclosure is not limited thereto, and it ispossible to receive and display additional response information on atleast one of the information on the output content, the userinformation, and the repeated keywords in the output content from theexternal server.

For example, a search result that a genre of an output content is akeyword can be received from an external server and displayed along witha short clip. In addition, the search result that a keyword obtainedfrom the uttered voice of a user and a genre of the output content arecombined can be received from an external server and displayed.

FIG. 9 is a flowchart to describe a method for providing a short clipaccording to an exemplary embodiment.

According to a control method of the electronic device as illustrated inFIG. 9, a content is output first (S910).

Then, an uttered voice of a user is received (S920).

Then, when an uttered voice is received, a short clip request signal istransmitted to the server based on the keyword and contents informationincluded in the received uttered voice (S930).

A short clip is output based on information regarding the short clipreceived from the server according to a request signal (S940).

Herein, the information on the short clip includes at least one of aposition where the short clip is stored and a time section including thekeyword. In step S940, when information on the short clip is receivedfrom the server according to the request signal, it is possible tooutput a short clip based on the information.

In addition, each of a plurality of short clips can be a video contentor an audio content which is generated by editing a part which includesa specific keyword from a specific content.

Further, in step S940, when additional information on the short clip isreceived, additional information on the short clip is provided. Here,the information on the short clip may include the title of the originalcontent, the genre, the broadcast time of the original content,broadcasting station information of an original content, and a keyword.

In addition, in S940, additional information regarding a short clip canbe provided as audio through a speaker.

In addition, the electronic device may include at least one of a displayand a speaker. In step S930, a signal requesting a short clip related toa keyword can be additionally transmitted to a server based on a keywordwhich is repeated for more than a predetermined times for predeterminedtime from audio which is output through a speaker.

In S940, based on the keyword included in the received uttered voice,additional response information regarding the uttered voice can beprovided along with the short clip.

In step S930, a request signal including a keyword and user informationis transmitted to the server. In step S940, a short clip related to thekeyword and the regular user information may be received from the serverand output.

In step S930, the received uttered voice can be transmitted to the voicerecognition server or the aforementioned server, and the short cliprequest signal may be transmitted to the server based on the keyword andthe information on the content received from the voice recognitionserver or the server.

FIG. 10 is a flowchart to describe a system for providing a short clipaccording to an exemplary embodiment.

According to FIG. 10, first, the server 200 receives contents from thecontents provider 300 (S1010). Hereinafter, the content received fromthe content provider 300 is referred to as an original content. In themeantime, the server 200 can receive the content from the contentprovider 300 in real time. If the content provider 300 is a broadcastingstation, the server 200 can receive the broadcasting program transmittedin real time from the broadcasting station as original contents.

Then, the server 200, based on keywords of the received originalcontents, generates a plurality of short clips (S1020).

The server 200 stores the generated plurality of short clips andkeywords by short clips (S1030).

In the meantime, the electronic device 100 receives user's uttered voice(S1040).

Then, a short clip request signal related to a keyword included in thereceived uttered voice is transmitted to the server 200 (S1050).

The electronic device 100 receives a short clip from a server (S1060).

Then, the electronic device 100 outputs the received short clip (S1070).

Hereinbelow, when the electronic device 100 does not have a displayfunction, a method for providing a short clip through a speaker will bedescribed.

FIG. 11 is a view to describe a method that an electronic deviceprovides a short clip through a speaker according to another exemplaryembodiment.

According to FIG. 11, the electronic device 100 may be provided with aspeaker as an outputter, and no display may be provided. In this case,the electronic device 100 can output an audio signal of a short clipfrom the server 200 and provide the same. As an example, if the shortclip contains both the video signal and the audio signal as videocontent, the electronic device 100 may provide only the audio signal inthe received short clip.

As shown in FIG. 11, when “please inform current weather” is received asa uttered voice, a short clip having “current weather” as a keyword canbe provided. In this case, the location information of the electronicdevice 100 may be additionally received as described above to provide ashort clip for the current weather in the specific area (e.g., currentweather in New York). In addition, since the electronic device 100 maynot have a display, only the audio signal of the received short clip maybe output.

Further, when the additional information on the short clip is receivedas described above, additional information on the short clip can beconverted into an audio signal and provided. For example, whenadditional information about the short clip and the short clip isreceived from the server 200, additional information on the short clipmay be output first, and the audio signal included in the short clip maybe sequentially output.

The electronic device 100 according to an embodiment of the presentinvention may output only a part of the additional information about thereceived short clip as audio. For example, when the title, genre, andbroadcast time of the original content are received as additionalinformation about the short clip, the electronic device 100 providesonly the title of the original content as an audio signal, and outputthe audio signal of the received short clip.

In addition, the electronic device 100 according to an embodiment of thepresent invention can sequentially provide a plurality of short clipsbased on a predetermined priority when a plurality of short clips arereceived from the server 200. For example, the electronic device 100 canoutput audio signals included in a plurality of short clips through aspeaker in accordance with a generation sequence of a short clip.

Therefore, a user, even the electronic device does not have a displayfunction, may receive a short clip and additional information regardingthe short clip as an audio signal.

The methods according to various exemplary embodiments can beimplemented as software installable in the conventional electronicdevice and server, program, or applications.

In addition, the methods according to the exemplary embodiments can beimplemented only by software upgrade of the conventional electronicdevice or server, or hardware upgrade.

Meanwhile, the method of controlling an electronic device according tovarious embodiments of the present invention described above may beimplemented as a computer-executable program code and provided to eachserver or device such that it is stored in the non-transitory computerreadable medium and executed by the processor.

In addition, the method of controlling an electronic device according tovarious embodiments of the present invention described above may beimplemented as a computer program product including a computer readablemedium including a computer readable program executed by a computerdevice. The computer-readable program may also be stored in acomputer-readable storage medium at a server, and the program may beembodied in a form downloadable via a network to a computer device.

A non-transitory readable medium is not a medium for storing data for ashort time such as a register, a cache, a memory, etc., but means amedium that semi-permanently stores data and can be read by a device. Inparticular, the various applications or programs described above may bestored and provided on non-volatile readable media such as CD, DVD, harddisk, Blu-ray disk, USB, memory card, ROM.

The foregoing exemplary embodiments and advantages are merely exemplaryand are not to be construed as limiting the exemplary embodiments. Thepresent teaching can be readily applied to other types of apparatuses.Also, the description of the exemplary embodiments of the presentinventive concept is intended to be illustrative, and not to limit thescope of the claims.

What is claimed is:
 1. An electronic device comprising: a communicatorconfigured to communicate with a server for storing information on aplurality of short clips and keywords by the plurality of short clips;an outputter; an inputter; and a processor configured to: in response toa user's uttered voice being received through the inputter, transmit ashort clip request signal to the server based on a keyword included inthe received uttered voice and information on a content output from theoutputter, and output a short clip through the outputter based oninformation on a short clip received from the server according to therequest signal.
 2. The electronic device of claim 1, wherein informationon the plurality of short clips includes at least one of information ona position in which the plurality of short clips are stored and timesection including the keyword, and the processor, in response toinformation regarding a short clip being received from the serveraccording to the request signal, outputs the short clip based on thereceived information.
 3. The electronic device of claim 1, wherein eachof the plurality of short clips is a video content or an audio contentwhich is generated by editing a portion including a specific keywordfrom a specific content.
 4. The electronic device of claim 1, whereinthe processor, in response to additional information regarding the shortclip being received, provides additional information regarding the shortclip, and the additional information regarding the short clip includesat least one of a title and a genre of an original content, broadcasttime of the original content, generation time of the short clip,broadcaster information of the original content, and the keyword.
 5. Theelectronic device of claim 1, wherein the outputter comprises at leastone of a display and a speaker.
 6. The electronic device of claim 4,wherein the outputter is configured to include only a speaker, and theprocessor provides additional information on the short clip as an audiothrough the speaker.
 7. The electronic device of claim 1, wherein theoutputter comprises at least one of a display and a speaker, and theprocessor additionally transmits a short clip request signal related tothe keyword to the server based on a keyword which is repeated for apredetermined times or more during a predetermined time from audio thatis output through the speaker.
 8. The electronic device of claim 1,wherein the processor provides additional response information on theuttered voice along with the short clip based on a keyword included inthe received uttered voice.
 9. The electronic device of claim 1, whereinthe processor transmits the request signal which includes the keywordand user information to the server and receives a short clip relating tothe keyword and the user information from the server.
 10. The electronicdevice of claim 1, wherein the processor, in response to the utteredvoice being received, transmits the received uttered voice to the voicerecognition server or the server, and transmits a short clip requestsignal to the server based on the keyword and information on the contentreceived from the voice recognition server or the server.
 11. Acontrolling method of an electronic device which communicates with aserver for storing information on a plurality of short clips andkeywords by the plurality of short clips, the method comprising:outputting a content; receiving a user's uttered voice; in response tothe uttered voice being received, transmitting a short clip requestsignal to the server based on a keyword included in the received utteredvoice and information on a content output from the outputter; andoutputting a short clip through the outputter based on information on ashort clip received from the server according to the request signal. 12.The method of claim 11, wherein information on the plurality of shortclips includes at least one of information on a position in which theplurality of short clips are stored and time section including thekeyword, and wherein the outputting comprises, in response toinformation regarding a short clip being received from the serveraccording to the request signal, outputting the short clip based on thereceived information.
 13. The method of claim 11, wherein each of theplurality of short clips is a video content or an audio content which isgenerated by editing a portion including a specific keyword from aspecific content.
 14. The method of claim 11, wherein, in response toadditional information regarding the short clip being received,providing additional information regarding the short clip, and theadditional information regarding the short clip includes at least one ofa title and a genre of an original content, broadcast time of theoriginal content, generation time of the short clip, broadcasterinformation of the original content, and the keyword.
 15. The method ofclaim 14, wherein the outputting the short clip comprises providingadditional information on the short clip as an audio through thespeaker.